Reducing the Ising model to matchings 



Version: July 2, 2009 
Mark L. Huber 

Department of Mathematics and Computer Science, Claremont McKenna College 
mhuber@cmc.edu 

Jenny Law 

Department of Mathematics, Duke University 
waijlaw@math . duke, edu 



Abstract 

Canonical paths is one of the most powerful tools available to show that a Markov 
chain is rapidly mixing, thereby enabling approximate sampling from complex high 
dimensional distributions. Two success stories for the canonical paths method are 
chains for drawing matchings in a graph, and a chain for a version of the Ising model 
called the subgraphs world. In this paper, it is shown that a subgraphs world draw 
can be obtained by taking a draw from matchings on a graph that is linear in the size 
of the original graph. This provides a partial answer to why canonical paths works 
so well for both problems, as well as providing a new source of algorithms for the 
Ising model. For instance, this new reduction immediately yields a fully polynomial 
time approximation scheme for the Ising model on a bounded degree graph when the 
magnitization is bounded away from 0. 

Keywords: Monte Carlo, simulation reduction, canonical paths, fpras 



1 Introduction 

The Markov chain Monte Carlo (MCMC) approach remains the most widely used method- 
ology for generating random variates from high dimensional distributions. Let 7r be a distri- 
bution on a finite state space fl A Markov chain is a stochastic process {Xi, X 2 , . . .} on Q 
so that P(X t+1 G A\X U X 2 , . . . , X t ) = P(X m G A\X t ). 

Call a chain on a finite state space ergodic if there exists an N such that for all n > N 
there is positive probability of traveling from any state x to any other state y in n steps. For 
ergodic chains, the limiting distribution of X t will equal the stationary distribution of the 
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chain. Using well known methodologies, it is straightforward to build Markov chains whose 
stationary distribution matches a target distribution tt. (See [S] for more details.) 

One ingredient is missing: the question of how large t must be before the distribution of 
X t is close to 7r in some sense such as total variation. This t is known as the mixing time 
of a Markov chain, and unless it can be found for the chain in question, MCMC remains a 
heuristic rather than an algorithm for approximate sampling. 

A breakthrough occurred when Jerrum and Sinclair developed the ideas of conductance 
and canonical paths into tools capable of proving the mixing time for complex chains on 
high dimensional spaces. In [9] , they utilized conductance to show that a chain of Broder [3] 
for generating uniformly from the perfect matchings of a graph was rapidly mixing under a 
condition that encompassed a range of interesting problems such as uniform generation of 
regular graphs [TO] . 

The development of canonical paths for the Ising model [TTJ followed. The use of ap- 
proximate samples derived from a Markov chain together with selfreducibility [13] yields an 
approximation for the partition function of the Ising model, and this is still the only fully 
polynomial time randomized approximation scheme (fpras) known for this problem. 

Later uses of canonical paths included an extension from perfect matchings to all match- 
ings [12J and an algorithm for finding perfect matchings in polynomial time in all graphs 

(MM)- 

Canonical path approaches have also been used on such varied problems such as choosing 
approximately uniformly from a convex set [5] and 0-1 Knapsack solutions [T7] . 

This work is a step towards understanding the relationship between some of these prob- 
lems. We show 

• Sampling from the subgraphs Ising model with zero magnitization can be accomplished 
by generating a perfect matching in a graph linear in the size of the original graph. 

• Sampling from the subgraphs Ising model with positive magnitization on a graph can 
be accomplished by generating a matching in a graph linear in the size of the original 
graph. 

• Sampling from perfect matchings in unbalanced bipartite graphs can be reduced to 
sampling from perfect matchings in balanced bipartite graphs. 

• Sampling from matchings in a bipartite graph can be reduced to sampling from perfect 
matchings in a bipartite graph. 

Matchings in unbalanced bipartite graphs arise in the approximation of the permanent of 
rectangular matrices, used in bounding the performance of digital mobile radio systems 
(see [19J.) While conductance could be used directly to bound the mixing time of an appro- 
priate chain for this problem, it is easier to just utilize a sampling reduction. 

The remainder of the paper is organized as follows. Section [2] describes the various 
distributions that will be studied. Section [3] gives the general form of reduction by adding 
edges and nodes that will be used throughout. Section 0] details the reduction from the 
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subgraphs world for Ising to the matching distribution. Section [5] presents the reduction 
from unbalanced bipartite graphs to balanced bipartite graphs while Section gives the 
reduction from matchings to perfect matchings in bipartite graphs. Finally Section [7] notes 
how this reduction leads to deterministic bounds on the Ising partition function for large 
magnitization problems. 



2 The models 

This section describes the three models that will appear throughout the paper: The Ising 
model, weighted matchings in graphs (the monomer-dimer model in physics), and weighted 
perfect matchings in graphs (the dimer model). Each of these three problems is presented 
in terms of a weight function w(x) where x is a configuration in state space Q. Create 
a distribution n by setting 7r(x) = w(x)/Z, where Z = ^2 yen w(y) is called the partition 
function. 



The Ising model Originally a model of magnetism, the Ising model has a long history of 
study because of the existence of a phase transition on two dimensional lattices (see |18j.) 
Three different formulations of this model use spins, subgraphs, and random clusters. While 
the spins formulation is the most widely known, the subgraphs formulation will be of most 
use here (as was also the case in |llj.) 

Given a graph G = (V,E), the state space for the subgraphs world is fl su b s = {0, 1} E , 
so that each state, also known as a configuration, indexes a subgraph of G. Like many 
distributions of interest, the subgraphs world is given as a nonnegative weight function over 
configurations that is normalized by the partition function. 

The parameters for the subgraphs world are as follows. Each edge e has weight A(e) G 
[0, 1] that controls the strength of interaction between the endpoints of the nodes, and each 
node % has weight fi(i) G [0, 1] that controls the strength of the magnetic field for that node. 
Given the A and /i vectors, the weight of a configuration is: 



w SVLhs (x) 



n *<< 



e:x(e)=l 



n 



i:deg(i) is odd under x 



(1) 



1 such 



Here deg(i) under x is Ylj-{ij}eE x (j)i and counts the number of edges with x(e) 
that one endpoint of e is i. As usual, the empty product is taken to be 1. 

The vectors A and n are in a one to one correspondence with the inverse temperature and 
magnetic field parameters found in the more common spins worlds formulation. Moreover, 
the normalizing constant for the subgraphs world Z su i, s is a multiple of the normalizing 
constant for the spins world, and the multiple can be calculated explicitly (see |llj.) 

Calculation of the partition function Z SU \ )S was shown in [TT] to be a #P complete problem 
for general graphs, and so it is unlikely that a polynomial time algorithm for finding Z S ubs 
exactly will be found. 
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Matchings A matching in a graph is a collection of edges such that no two edges share an 
adjacent node. As before, a collection of edges can be indexed using x(e) = 1 if the edge is 
in the collection, and x(e) = if it is not. Using the weight formulation from before, 



Wmat(z) 



n m. 



e:x(e)=l 



n o 



i:deg(i)>2 under x 



As with Ising above, the empty product is taken to be 1, and so the only configurations with 
positive weight have no nodes with degree greater than 2. 

There exists an algorithm for computing Z mat when the graph G is planar ([7J [T5], |2T]). 
but for general graphs the problem is #P complete (see [T2].) 



Perfect matchings A perfect matching in a general graph is a collection of edges such that 
every node is adjacent to exactly one edge. That is, 



^permat y^) 



n a w 



e:x(e)=l 



n o 



i:deg(i)^l under x 



The partition function is also known as the hafnian of a matrix where the entry is 

A({i,7'}). 

Now consider a bipartite graph Gb = (Vi U V 2 ,E), where $V\ < j^V-i- Since the graph 
is bipartite {i,j} in E means that exactly one of % and j is in Vi, and exactly one is in V2. 
Here a perfect matching will mean that every node in the smaller partition Vi is adjacent to 
exactly one node. This definition is designed to fit with the definition of the permanent for 
rectangular matrices found in [19|. 



^bipermat y^) 



n *<< 



e:x(e)=l 



n 



iGVi:deg(i)^l under x 



The partition function is also known as the permanent of a matrix where the entry is 
the weight of an edge from the ith node of Vi to the jth node of V 2 - 

The permanent problem (and the more general problem of finding the hafnian) is a j^P 
complete problem, even under the restrictive condition that A(e) G {0, 1} for all e G E [12J. 



2.1 Canonical paths results 

The presentation here follows that of [12J. Recall that the total variation distance between 
two distributions /x and tt is distTv(A i ) 7r ) = su Pa IM^) - tt(^4)|- In a canonical paths argument, 
for any two configurations x, y, a path x = Xq,xi, . . . ,Xk = y is fixed where F(X t +i = 
Xi+i\Xt = Xi) > 0. For a set T of paths, let j(x,y) denote the path from state x to state y. 
Given T, the parameter p measures how often the paths use a particular move from X{ to 
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A Markov chain is reversible with respect to 7r if for all states i and j, 

ir(i)F{X t+1 = j\X t = i)= n(j)F(X t+1 = i\X t = j). 

If a chain is reversible with respect to a distribution tt, then it must be a stationary dis- 
tribution. For such a reversible chain, let Q(e) = Q({i,j}) = 7r(i)P(X t+ i = j\X t = i). 
Then 

p = p(r) :=max — — V 7r(ar)7r(y)length(7(ar,j/)). (2) 

A relationship between the mixing time of reversible chains and canonical paths was shown 
by Diaconis and Stroock [I]. The following form of the theorem is from |12j : 

Theorem 1. Let Ai be a finite, reversible, ergodic Markov chain with F(X t +i = i\X t — i) > 
1/2 for all i, and canonical paths T. Fix x G Q and e > 0. Then for all t > r e (x,T) : = 
p(r)(ln7r(x) _1 + lne -1 ) and any A C Q: 

\F(X t eA\X = x)-n(A)\<e. 

The number of steps necessary for the total variation distance to fall below e from a 
starting state will be referred to as the mixing time of the chain. The theorem states that 
T e (x, T) is an upper bound on the mixing time. Starting at x, this upper bound is proportional 
to the loading p for the set of canonical paths T. Therefore, it is important to find paths 
that keep the use of any one edge as low as possible. 

Now the results for the subgraphs world and matching can be stated. In [TT], it was 
shown for the subgraphs world that 

p(r)<2(#£) 2 /min/i(z) 4 . 

The configuration with x(e) =0 for all e has weight 1. All configurations have weight 
at most 1 and there are at most 2# E configurations, so 7r(0) > 2~# E , and starting from the 
empty configuration the mixing time is bounded above by 

2(#£) 2 (maxMz)- 4 )[(ln2)#£ + Ine" 1 ]. (3) 

i 

In [12], it was shown for the matching problem that there exist paths where 

p(T) < 4(#£)(#V)A' 2 , A' := max{l, A( ei ), . . . , X(e #E )} 

It is possible to find the maximum weight matching in polynomial time via Edmonds 
algorithm [6] . There are at most 2* s matchings, and so starting from this maximum weight 
matching the mixing time is at most 

4(#E)(#F)A' 2 [(ln2)#E + ln6- 1 ]. (4) 

For perfect matchings in general graphs, no polynomial time algorithm is known. How- 
ever, for bipartite graphs, in a landmark paper Jerrum, Sinclair, and Vigoda [TT] showed 



5 



how with a sequence of Markov chains, it was possible to obtain approximately drawn 
variates in polynomial time. The running time of this procedure was later improved to 
6(#V 7 ln 4 #^) 0. 

In the the remainder of the paper, it is shown how to obtain draws from the subgraphs 
world distribution by generating draws from the matching distribution. The running time 
using the reduction will be of the same order as a direct approach using canonical paths. 

3 Reductions by adding edges and nodes 

All of the reductions presented here have the same form. Given graph G = (V, E), consider 
drawing from a distribution tt on {0, 1} E . First construct a new graph G' = (V, E\ U E 2 ), 
where is a one to one correspondence from E to E\. Then draw X' from tt' on G', and 
let X(E) = X'((j)(E)). Then the following lemma gives a sufficient condition for X to be a 
draw from tt. 

Theorem 2. Suppose that tt(x) = w(x)/Z where Z = ^2 y w(y) when used on G = (V,E), 
and ir'(x) = w'(x)/Z' where Z' = ^2 y w(y) over @' = (X -^i U E 2 ), an d 4> is a one to one 
correspondence from the edges of E to E\ . Suppose that the weight functions satisfy: 

™V) = w(x)C, (5) 

x':x'(<f>(E))=x(E) 

where C is a constant. If (TJJ) holds, X' ~ tt' and X(E) = X'((p(E)), then X ~ tt. 
Proof. Fix x G {0, 1} E , and note 

P(X = x) = p ( x = X \ X> = x'MX' = x')= w '( x ') l Z '- 

x' x':x'{<t>(E))=x(E) 

But by assumption, the numerator of the right hand side is just Cw(x), so ¥(X = x) = 
w(x)C/Z', so X ~ tt and Z = Z'/C. □ 

Reduction for subgraphs to maximum degree 3 This theorem can be applied to reduce 
any subgraphs world problem to one where the maximum degree is three. Suppose that 
« is a node with degree greater than 3 in G = (V,E). Let ji, j 2 , . . . , jdeg(i) denote the 
neighbors of i. Then consider the subgraphs distribution on a new graph G' = (V, E') 
where V = (V \ {i}) U {h,i 2 }, and 

E' = [E U {h,i 2 } U {{ii,ji}, j 2 }} U {{i 2 , j 3 }, {i^u}, • • • , {«2, jd eg (i)}}) 
\U j:{iJ}eE {{i,j}}. 

In other words, node % is being split into two nodes, %\ and i 2 . Node %\ is connected to 
the first two neighbors of i, while i 2 is connected to the remaining neighbors of i. Finally, 
the nodes i\ and i 2 are connected. Figure [1] illustrates this process. 
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Figure 1: Splitting a node 

The function is the identity map for any edge that is not adjacent to i. Edges 
and {i, j'2} rnap to and {11,32} respectively, while {i,jk} maps to {«2, Jfc} for k G 

{3, ...,deg(z)}. 

The point of splitting i is that node i\ now has degree 3, and deg(i 2 ) = deg(z) — 1. 
Repeated deg(z) — 3 times, the nodes that result from splitting i all have degree 3. This can 
be repeated for every node of the graph with degree greater than 3 until none remain. The 
following lemma verifies that this is a valid reduction. 

Lemma 1. Suppose G = (V,E) and G' = {V',E') are as described above. Set \'(<f)(E)) = 
\{E), n\V \ {t}) = n{V \ {1}), A'({ii, i 2 }) = 1, and 

H'(i x ) = n'{i 2 ) = - ^/fi(i)- 2 - 1, 

when /i(z) > ; and = ^'(22) = otherwise. Then drawing X' from 7r su b s on G' and 

setting X(E) = X'((f)(E)) yields X ~ 7r su b s on G. 

Proof. Fix x G {0, 1} E . In light of Theorem [2j it suffices to show 

\J w(x') = w(x)C, (6) 

x':x'{<t>(E))=x(E) 

where w(-) = w S ubs( - )- There is only one edge in E' not in 4>(E): the edge {ii,^}- There are 
two values for this edge, and so the sum consists of two terms. 

The first case to consider is when i has odd degree in x. When = the configuration 
x has probability under 7r su b s , and so no check is necessary. 

When > 0, then whether edge {11,12} is 1 or in x', exactly one of i% and 12 has 
odd degree, while the other is even. Hence a factor of //(ii) or //(i 2 ) is contributed to the 
weight, but a factor of n{i) is missing so w'(x') = for one term in the sum, 

and w'(x') = /i'(i 2 )/i(i) _1 u7(x) in the other term. Hence ([6]) becomes: 

+ n(i)^ 1 fi'(i2)w(x) = w(x)C. (7) 
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On the other hand, when % has even degree, one choice of x\{ii, 12}) leads to both i x and 
%2 having odd degree, while with the other choice both have even degree. Hence (E]) becomes: 

w(x) + ji fi \i2)w *(x) = w(x)C. (8) 

When = 0, only this equation needs to be satisfied, and so C = 1 + /u'(ii)//(i 2 ) 
applies. However, note that 

1 + //(zi)//(z 2 ) = 1 + [jiii)- 1 - VMO^-l) 2 = 2/i(0" 2 - 2//(i)-VM«) -2 - 1- 

This right hand side is exactly /x(z) _1 [//(zi) + //(z 2 )], so setting C equal to this expression 
satisfies both j7j) and (jHJ), finishing the proof. □ 

The downside of this construction is that the magnetic field is smaller at each of the 
duplicated nodes since < unless G {0, 1}, in which case they are equal. The 

good news is that once < j^V~ x it can be replaced by j^V~ x without changing the 
distribution too much. When \x{%) is this small, the chance that a draw from 7r su b s will result 
in all nodes having even degree is at least exp(— 1), so simple acceptance rejection can be 
used [TT] . 

Consider starting with a graph G = (V, E) and repeatedly applying this reduction until 
the maximum degree of the graph is 3. Then the number of new edges created at each node 
equals the degree of the node minus 3. Hence there are at most ^degfz) = 2#E edges 
created, resulting in at most 3#E total edges. 



Reduction for subgraphs to all degree 3 nodes Not only can the maximum degree be 
reduced to 3, but the degree 1 and 2 nodes can be eliminated as well. 

Lemma 2. Let G = {V,E) be a graph with degree 1 node i and edge Create G' = 

(V, E') byV = V\{i}, E' = E\{i,j}. Say X' ~ 7r subs onG' with parameters X'(E') = X(E), 

H , (V'\{j})=l*{V'\{j}), ^ 

fi{3> i+A({z,i}v(zvor 

Let X(E') = X'(E'). Given X(E'), let X({i,j}) ~ Bern(5), where 5 is 6 1 if deg(j) is odd 
under X' , and 62 if deg(j) is even under X' where 

r = c A({z,j})^(i)/i(j) 

1 Kj) + WJWY 2 i + A({*,j})M»>(jr 

Then X ~ 7r su b s with parameters A and \i. 

Proof. Intuitively, a degree 1 node with no magnetic field is irrelevant to the rest of draw 
since the edge will always be off. In this case b~\ = 62 = 0, so the edge is always removed. 
A degree 1 node with magnetic field is linked only to its nearest neighbor, so it makes that 
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neighbor a little more likely to be spin up, which is the same as increasing its magnetic field. 
The amount of increase depends on the magnetic field of % and the strength of interaction 
to j. 

Fix x G {0,1} E . Let r(x) = H eeE , :x{e)=1 A(e) Ukev'\{j}:d eg (k) is odd in x M*0 ■ Th en there is 
a unique x' G {0, 1} E that matches x on E', and the following table summarizes the proba- 
bilities needed: 



j under x' w suhs (x') x{{i,j}) w suhs {x) 

odd r{x)n'{j) 1 r(x)\({i,j})n(i) 

odd r(x)fj, , (j) r(x)fx(J) 

even r(x) 1 r(x)A({i, 

even r(x) r(x) 

Note P(X = x) = F(X'(E') = x(E'))¥(X({i, j}) = x({i,j}). Let 6 1 = F(X({i,j}) = 
l\j is odd in X') and 5 2 = F(X({i, j}) = l\j is even in X'). Requiring that this probability 
is proportional to w su b s (x) together with the above table gives rise to four equations (C is 
the common constant of proportionality): 

r(sy(j)*i = r(x)\({i,j})n(i)C 

r(x)M)(l-6i) = r{x)n{j)C 

r( x )5 2 = r{x)\{{i,j})n(i)n{j)C 

r(x)(l-8 2 ) = r(x)C 

The r(x) factor cancels out from both sides, and adding the last two equations yields C = 

(1 + A({z,j})^(i)A*0")) _1 and then ^ = \({iJ})pi(i)pi(j)C. 

Adding the first two equations yields //(j?) = [fJ>(j) + \({i,j})(J,(i)]C, which in turn gives 
5\ = + j}) nil))' 1 . Since [J,'(i), Si, and S 2 are in [0, 1], this completes 

the proof. □ 

Removing degree 2 nodes can be accomplished as well by taking these nodes and splitting 
them into two copies connected by an edge, making the degree of each copy equal to 3. The 
details are presented in the appendix. 



4 Reduction of Ising to matchings and perfect matchings 

In this section the technique of the previous section will be used to reduce the subgraphs 
world with no magnetic field to a perfect matching problem, and to reduce the subgraphs 
world with positive magnetic field to a matching problem. 

Reduction to perfect matchings when the magnetic field is zero When the magnetic 
field is zero (so fi(i) = for all i), the subgraphs world model can be reduced to a problem 
where every degree is 3 and all n(i) remain at 0. It is from this graph that the graph for 
perfect matchings will be constructed. 
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Each edge {i, j} in the original graph is given two new nodes and Vji. Connect these 
two nodes by an edge of weight j})- Each node i in the original graph is given a new 
node di. Connect di and with an edge of weight 1 for all j. 

Finally, connect nodes of the form Vij, Vik for all j and k with an edge with weight 1/3. 
Set 4>({i,j}) = {vij,Vji}. Figure [2] illustrates a piece of this transformation. 




Figure 2: New graph for degree 3 nodes 

For a graph (V, E) where every node has degree 3, this transformation creates four nodes 
for each original node. These four nodes are connected by 6 new edges, and the original 
edges still remain. Hence after modification, the new graph has nodes and j^E + 6#V 
edges. 

Lemma 3. Given a graph G = (V, E) with all nodes of degree 3, identically and edge 
weights X, let n(v,l), n(v,2) and n(v,3) denote the neighbors of node v. For each i, set 
Ai = {di, Ui, n (i,i), Uj, n (i,2), ^i,n(i,3)} ; an d E{ be all subsets of size 2 in Ai so that (Ai,Ei) is the 
complete graph on 4 vertices. Set 

V = U ieV Ai 

E' = (u {iij}eE {{ Vij ,v 3i }}) U(UiEi). 

For edges of the form e = {vij,Vji}, set A'(e) = j}). For edges e = {di,Vij}, set 
A'(e) = \, and for edges e = {vij,Vik}, set A'(e) = 1/3. 

Then if X' is drawn from 7r permat on G' with X' , and X(E) = X'((fi(E)), then X ~ 7r subs 
on G with X. 

Proof. As before, let j}) = Vji}. Fix a configuration x in the subgraphs world, and 
consider the number of x' such that x'(<ft(E)) = x(E). The edges that are free in such an 
x' are of the form {g^,^}, {t^-, t> ifc }, or {v ij, Vji}. Give weight 1 to all edges of the form 
{di, v^}, weight 1/3 to all edges of the form {v^, Vik}, and weight X({i, j}) to edges {v^, Vji}. 
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Then for x, the choice of x is a choice for how to fill out x(Ei) for each i. For a collection 
of edges F and a configuration x', let p(x'(F)) be the product of the edge weights over edges 
with value 1 in x', that is: 

p(af))= n w 

eeF:x'(e)=l 

Let .M be the set of configurations that correspond to a matching, so x' E A4 says that if e\ 
and e 2 are edges that share an endpoint, then either x'[t\) or a/(e 2 ) is 0. Using this notation 

Wpermat (•£ 

) = p(x'(4>(E))) Hpix'iE^lix' G M). 

i 

So 

^Pcrmat(x')= ^ p(x'(<P(E) ) ) J] ) 1 ( X ' G M ) . 

x':x / (^(£))=a;(£) x':x'(tf>{E))=x{E) i 

Since there is no magnetic field, p(x'(<p(E))) = w su t, s (x), and factoring the sum of the 
products yields: 

} u w peTmat {x') = w Bnhs (x) Yl p(x'(E i ))l(a; / G M). 

x':x'(</>(E))=x(E) i x'(Ei) 

To satisfy Theorem[2l it suffices that J2 x >r E A p(x'(Ei))l(x' G M) = 1 for all i. There are 
several cases to consider based on the degree of i under x. 

Suppose first the degree of i under a; is 0. There are three different x'(Ei) with nonzero 
weight. The three different x' come from the fact that dj is matched to one of three different 
nodes. Say x'({di,Vij}) = 1. Then to match k and £, x'({vik,va}) = 1, and all the rest of 
the edges e G Ei have x'(e) = 0. Since A'({<ij, v^) = 1 and \'({vik, v^}) = 1/3, p(x') = 1/3. 

Similarly, d{ could be matched to or vn, again resulting in p(x'(E i )) = 1/3. Hence 
J2 x , {Ei) piAE^lix' G M) — 1/3 + 1/3 + 1/3 = 1. 

Now suppose degree of % under x is 1. Then x'(Ei) needs to be a perfect matching on 
4—1 = 3 nodes, which is impossible. Similarly, if the degree of i under x is 3 . x (Ei) needs 
to be a perfect matching on 1 node: also impossible. Hence in these cases the sum of the 
weights is 0. Fortunately, since there is no magnetic field, w su b s (x) = is these cases also. 

Last, suppose that the degree of % in x is 2. Then di must be matched in x'(Ei) to 
whatever node adjacent to i is not already matching. The edge weight of this edge is 1, so 
^ 3 ,,( E .)P(a; / (£^i))l(a; / G M.) = 1, which completes the proof. □ 

Note, a similar gadget appears in [16] (pp. 125-147) in the specific case of the Ising 
model on two-dimensional lattices. 

Reduction to matchings for nonzero magnetic field When the magnetic field is positive 
for every node, the subgraphs world can be reduced to sampling matchings rather than 
perfect matchings. As noted earlier, when fi(i) < #V~ l , changing fi(i) to j^V~ x results in 
a distribution that when sampled from, yields a draw from the distribution for the original 
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with probability at least exp(— 1). Hence any subgraphs model can be altered to have 
iKi) > W 1 for all i. 

The G' constructed from G remains the same as in the no magnetic field case, all that 
changes is the construction of A' for the matchings. 

All edges of the form {t> ia , v^} receive the same edge weight Ai. Edges of the form {di, Vi a } 
receive edge weight A2. But this does not leave enough freedom to handle configurations 
where the degree is either 0, 1, 2, or 3, which gives rise to a system of four equations via 
([()]). One more parameter is the constant C in the equations, but that still only gives three 
unknowns and four equations. 

So another parameter a(i) must be added to each node. The new edge weights will be 
\'({vij,Vji}) = a(i)a(j)\({i, j}) , and now there are four unknowns. The solution to the four 
equations is presented in the following lemma. 

Lemma 4. Given graph G = (V,E), with maximum degree 3, build G' = (V',E') as in 
Lemma[3i let G' = (V, E') be the same as for the zero magnetic field case. For i of degree 
3 in G, let A 2 (i) be the smallest nonnegative solution to the cubic equation 

/i(z) 2 (l + A 2 (z)) 3 = 1 + 3A 2 (z) + 3A 2 («) 2 . 

In fact, there always exists a solution with < A 2 (i) < 3/i(z)~ 2 . Let 

Ai(i) = A 2 (*) 2 (l + A 2 (*))-\ a{i) = fi(i)(l + A 2 (i)). 

Then for each edge e of the form {vij,Vik}, set A'(e) = Xi(i), while for edges e of the 
form {di,Vij}, set A'(e) = A 2 (i). For edges e = with 0(e) = set A'(0(e)) = 

X(e)a(i)a(j). 

Let X' ~ vr mat , and set X(E) = X'((p(E)). Then X ~ vr su b s . 
Proof. Fix x G {0, 1} E . As before, for F C E' let p{x'(F)) = ]\ e&F:x , {e)=1 A'(e), so 

^P-mat (X') = ^ P(A<I>(E) ) ) ]J p(x' (Ei) ) 1 ( X ' G M ) . 

x>:x'((j>(E))=x(E) x':x'{4>{E))=x{E) i 

Let d(i,x) denote the number of edges adjacent to i that have value 1 in configuration x. 
Since each edge {vtj,Vji} receives an extra factor of a(i) and a(j) in its edge weight, 



p(x'((j>(E))) = w suhs (x 



n 

i 

w suhs (x)Y\_f{i 



a(i) d{i > x) 



n ^i)- 1 

i:d(i,x) is odd 



where 



f(i) := a{i) d{Lx) [l(d(i } x) is even) is odd)]. 
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Then 

w pcrmat (x') = w suhs (x) Y[f(i) Y p(x\Ei))l(x' E M). 

x':x'(<t>(E))=x(E) i x'(E x ) 

In order to prove the theorem, it suffices to have each term in the product in the right 
hand side equal a constant for each i. Call this constant C{i). Fix i. There are four possible 
values for d(i,x): 0, 1, 2, and 3. Each gives rise to an equation. In the equations, Ai is 
the weight for edges between nodes in {v ij, Vik, va}, and A2 is the weight given to all edges 
leaving di. (For simplicity, the dependence of Ai, A2, C and a on i is suppressed.) 



d(i, x) 


= : 


c = 


[l + 3Ai + 3A 2 + 3AiA 2 


d(i, x) 


= 1 : 


c = 


a/iii)- 1 ^ + Ai + 2A 2 ] 


d(i, x) 


= 2 : 


c = 


a 2 [l + A 2 ] 


d(i, x) 


= 3 : 


c = 





The expression in brackets on the right hand is the sum of the weights of x'(E,j) with nonzero 
weight. When d(i,x) = 3, it must be that x'(e) = for all e G Ei, so p(x'(E i )) = 1. 

When d(i, x) = 2, there are two possible matchings, either di is matched to the remaining 
node (giving weight A 2 ) or not (giving weight 1). 

When d(i,x) = 1, one of {vij,Vik,va} is matched: say without loss of generality vn is 
taken. Then the remaining nodes of Ei are {di,Vij,Vik} and they are all connected to each 
other. Hence the possible matchings are: empty matching (weight 1), matching Vij to Vik 
(weight Ai), matching di to % (weight A 2 ) and matching di to Vik (again weight A 2 ). Hence 
the sum of the weights of the matchings is 1 + Ai + 2A 2 . 

When d(i,x) = 0, any matching in Ei contributes to the sum. There is one matching 
of size (weight 1), six matchings of size 1 (three weight Ai, three weight A 2 ), and three 
matchings of size 2 (all weigh AiA 2 ). Hence the total sum of weights is 1 + 3Ai + 3A 2 + 3AiA 2 . 

Now the solution must be checked. First show existence. Let 

g(x) := /i(i) 2 (l + x) 3 - (1 + 3x + 3x 2 ). 

Note that 

#(3//(i)- 2 ) = fi(i) 2 + 8 + I8fi(iy 2 > 0, 

but g(0) = fi(i) 2 — 1 < Since g is continuous, there must be a solution to g(x) = fi(i) 2 with 
x E [0,3/i(i)~ 2 ]. 

Since a = //(«)( 1 + A 2 ), the d(i,x) = 3 equation has 

C = n(i) s (l + A,) 3 ^^)" 1 = 1 + 3A 2 + 3A 2 . 

Similarly, the d(i,x) = 2 equation has 

C = /i(*) 2 (l + A 2 ) 3 = l + 3A 2 + 3A 2 . 
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For the d(i, x) = 1 equation, using Ai = Af,/ (1 + A 2 ) yields: 

A 2 

C = (1 + A 2 )(l + —2— + 2A 2 ) = 1 + 3A 2 + 3A 2 . 
1 + A 2 

Finally, in the d(i, x) — equation, 

C = 1 + 3Ai + 3A 2 + 3AxA 2 = 1 + - — 2 — + 3A 2 + - — 2 — = 1 + 3A 2 + 3A 2 . 

1 + A 2 1 + A 2 

Hence this is a valid solution to all four equations, and by Theorem El the result follows. 

□ 

5 Reduction of perfect matchings in unbalanced graphs to balanced 
graphs 

Consider finding the permanent of a bipartite graph G = (V1UV2, -E 1 ) so (V{z, j} G E)(#({i,j}n 
Vi) = j} H V 2 ) = 1). Call the graph unbalanced if #Vi < #V 2 , and call a configuration 
x £ {0,1} E & perfect matching in an unbalanced bipartite graph if for all i G Vi, there exists 
a j G V 2 such that {i, j} G -E and x({z, j}) = 1. 

Smith studied the number of perfect matchings in unbalanced bipartite graphs to bound 
the performance of digital mobile radio systems [20J. The number of perfect matchings is 
equal to the permanent of the rectangular adjacency matrix (see [19]). 

The permanent of square matrices has attracted far more study than the rectangular 
case, and so the goal here is to reduce the problem of generating variates from unbalanced 
graphs to generating variates from balanced graphs. 

Fortunately, the reduction is easy to describe. Let V3 consist of — #Vi new nodes, 
and add edges from every node in V3 to every node in V 2 , and give them weight 1. Then G 
is still bipartite with node partition (Vi U V 3 , V 2 ). 

Consider a configuration x that was a perfect matching in the original graph. Then if x' 
is a configuration on the new graph with x'(E) = x(E), then exactly j^V\ nodes in V 2 are 
already matched. So to fill out x', the remaining nodes in V 2 must be matched to V3, and 
there are exactly (#V 2 — #Vi)! ways to accomplish this. Each of these has weight equal to 
the weight of x(E), and so J2x'-x'{e)=x(e) w '{ x> ) — ^(a ; )(#V 2 — #Vi)!. Theorem El then says 
that drawing x' from perfect matchings on the new graph and keeping x(E) = x'(E) results 
in a draw from the perfect matchings on the original unbalanced graph. 

Hence any algorithm for simulation and approximation of the partition function for per- 
fect matchings on balanced graphs (such as |H],[2]) can also be used for unbalanced problems. 

6 Reducing matchings to perfect matchings in bipartite graphs 

Consider a bipartite graph G = (Vi U V 2 , E). The problem of sampling from all matchings 
in this graph can be reduced to sampling perfect matchings as follows. First, for each node 
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Figure 3: Unbalanced to balanced bipartite graph 



i G Vi, create a node i' and add edge with edge weight 1. If V 3 is the set of i', this 

creates a new, unbalanced bipartite graph G' = (Vi U (V 2 U V 3 ),E). Furthermore, each 
matching in the original graph corresponds to a perfect matching in the new graph with 
equal weight. So a perfect matching sampled from the new graph yields a matching in the 
original. 

This can be reduced to a balanced bipartite graph as described in the previous section: 
create j^Vi new nodes and connect each of them to all of the nodes in Vi U V3. The final 
graph is still bipartite but now is balanced. 




Figure 4: Matchings to perfect matchings 

The point of this is that the new problem is a perfect matchings problem on bipartite 
graphs, and so samples can be generated in polynomial time ([HJ [2]) for any values of the 
edge weights. With the matching canonical paths approach, the time to generate a sample 
depends on the square of the largest edge weight, with this reduction, this dependence no 
longer appears. 
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7 Consequences of the Ising reduction 

The purpose of any reduction is so that existing methods for one problem can be immediately 
applied to the other. For instance, it was noted in equation ([3]) that the mixing time for the 
Ising model on graph G = (V,E) was upper bounded by 2(#£') 2 (max i /x(i)~ 4 )[(ln2)#i? + 
In e _1 ] using the canonical paths method. 

Suppose instead that the Ising model is first reduced to a graph with maximum degree 3 
with at most 3#E edges and Ij^E nodes. Then the graph is further altered so that a draw 
from the matchings distribution yields a draw from the subgraphs distribution. The mixing 
time for the matchings distribution on this graph is 4(3#E)(2#E)\' 2 [(ln2)#E + lne _1 ], and 
from the construction in Section [4] A' < 3 maxj /i(i)~ 2 . So the total mixing time is the same 
order in j^E as for direct analysis on the subgraph world, but the constant in front is larger 
by a factor of 108. 

The purpose of the reduction is so that any improvements or new algorithms for simu- 
lating matchings will automatically translate into new algorithms for the subgraphs world. 

As an example of this, Bayati et. al. [Il] have shown that for matchings in graphs of 
bounded degree and bounded edge weights how it is possible to construct a deterministic 
fully polynomial time approximation scheme for computing the partition function for the set 
of matchings. To be precise, their result states: 

Theorem 3. For a graph G = {V, E) with edge weights X, and e > 0, there exists an 
exp(e) approximation algorithm for Z = J2 xe {o,i}z-.x a matching Il«x( e )=i A ( e ) that runs m time 
0(n/e) Klog A+1 , where A is the maximum degree of the graph, X = max e A(e) and k = 
-2/log(l-2/[(l + AA) 1 /2 + i]). 

For the subgraphs distribution, after the reduction A = 3, and A < 3maxj fi(i)~ 2 . It can 
be shown this makes k less than 3.06 maxj ^(i)" 1 . 

Given the relationship between the partition function for subgraphs and matchings given 
by Lemma HI and the well-known relationship between the subgraphs world partition function 
and the Ising partition function (see [H]), this gives a fpras for the partition function of the 
Ising model for magnitization bounded away from 0. 

8 Appendix 

There are several ways to deal with degree 2 nodes. The simplest is to "clone" the node by 
replacing it with two nodes connected by an edge of weight 1, as shown in Figure El 

Lemma 5. Let G = (V,E) be a graph where node i has degree 2. Let {i,j} and {i,k} be 
the edges adjacent to i and suppose the parameters for 7r subs are given by A and ji. Construct 
G' = {V, E 1 ) by letting V = (V\ {i}) U {if, i"} and 

E' = (E\ {{i,j}, {i, k}}) U {{i',j}, {i",j}, K, k}, {i", k}, {if, i"}}. 
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Figure 5: New graph for degree 2 nodes 



Set Xi, X 2 , and /xi in [0, 1] to satisfy 



2Al KiU}), T^TJ = *({<,*}), t^5 = A*(0- (10) 



Lei A'({z',j}) = X'({i",j}) = A 1; A'({z',A;}) = X'({i", k}) = X 2 , //(«') = //(z") = Hi, and 
X'({i',i"}) = 1. For a// oi/ier edges and nodes, let X' and // egwa/ A and /i respectively. 

Draw X' ~ vr subs twins (A',/A Let X({i,j}) = [X'({i',j}) + X'({i", j})} mod2, and 
X({i,ifc}) = [X'({z',A;}) +X'({i",A;})]niod2, T/ien X ~ vr subs using (A, /x). 

Proof. Note that the equations given in ( |T0|) can be easily solved for Ai, A2 and \i\ using the 
quadratic formula. For an equation of the form 2x/(l + x 2 ) = y, the left hand side is at 
x = 0, is 1 at x = 1, and is continuous. Hence for y G [0, 1], there is a solution x G [0, 1]. 

Now to show that the output X of the procedure has the correct probability. Consider 
that if X({i,j}) = 1, then either X'({i',j}) = 1 or X'({i",j}) = 1 but not both. In either 
of the two factor of Ai is contributed, making a total contribution of 2Ai. A similar 

result holds when X({i, k}) = 1. 

When X({i,j}) = 0, either both {i',j} and {i",j} are in X' or both are 1 in X'. The 
total weight contribution is therefore 1 + X\ . 

Now consider what happens with {i',i"}. In the case that X({i,j}) = X({i, k}) = 1, one 
choice of X'({i', i"}) leads to both i' and i" being odd, and the other choice leads to both i' 
and i" being even. This makes the total weight of these contributions 1 + {if. 

This situation also arises when X({i,j}) = X({i,k}) = 0. On the other hand, when 
X({i,j}) 7^ X({i, k}), one choice of X'({i' ,i"}) will make one of {i',i"} even and the other 
odd, while the other choice flips the parity of both i' and i" . Hence the total contribution 
to weight is \i\ + 

Let the function / be the transformation that takes x' and constructs a state x. That is, 
f(x')(e) = x'(e) for all e G E\{{i,j}, {i, k}}, f(x')({i,j}) = (x'({i>, j}) +x'({i", j})) mod 2, 
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and f(x')({i, k}) = (x'({i', k}) + x'({i", k})) mod 2. In the table below, factors from edges 
and nodes that appear in both x'(E) and f(x')(E) are the same, and so are neglected. 

x ({h3}) x({i,k}) ^,,. /(x , )= ,w subs (x') w suhs (f(x')) 

(l + Af)(l + A!)(l + fx?) 1 

1 (1 + A^)(A 2 + A 2 )(//i + fii) \{{i,k})fi(i) 

1 (Ax + A 1 )(1 + A|) + A({z,j})/i(z) 

1 1 (A 1 + A 1 )(A 2 + A 2 )(1 + ^) \({i,j})\({i,k}) 

By the way Ai, A 2 and /ii are defined, when C — (1 + Af ) (1 + A^Xl + /if), the equation 

^2 w ™bs(x') = Cw suhs (x) 

x' :f(x')=x 

holds. As in the proof of Theorem [21 this implies 

F(X = x)= = x') = Cw suhs (x)/Z, 

x' :f(x')=x 

and hence X has the desired distribution. □ 

Note that after this reduction, the degree of i' and i" is 3, while the degree of j and k 
is increased by 1. After raising the degree of the other nodes, they can be split apart as in 
Section [3] if needed. 
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